Dataset statistics
| Raw_Feat | Binned_Feat | |
|---|---|---|
| Number of variables | 8 | 8 |
| Number of observations | 5625 | 5625 |
| Missing cells | 0 | 0 |
| Missing cells (%) | 0.0% | 0.0% |
| Duplicate rows | 0 | 969 |
| Duplicate rows (%) | 0.0% | 17.2% |
| Total size in memory | 357.2 KiB | 486.2 KiB |
| Average record size in memory | 65.0 B | 88.5 B |
Variable types
| Raw_Feat | Binned_Feat | |
|---|---|---|
| Numeric | 7 | 4 |
| Categorical | 1 | 4 |
| Raw_Feat | Binned_Feat | |
|---|---|---|
churn is highly overall correlated with customer_service_calls | churn is highly overall correlated with customer_service_calls | High Correlation |
customer_service_calls is highly overall correlated with churn | customer_service_calls is highly overall correlated with churn | High Correlation |
churn is highly imbalanced (57.3%) | churn is highly imbalanced (57.3%) | Imbalance |
customer_happiness has unique values | Alert not present in this dataset | Unique |
customer_service_calls has 2813 (50.0%) zeros | customer_service_calls has 2985 (53.1%) zeros | Zeros |
| Alert not present in this dataset | Dataset has 969 (17.2%) duplicate rows | Duplicates |
| Alert not present in this dataset | total_day_charge has 938 (16.7%) zeros | Zeros |
| Alert not present in this dataset | customer_happiness has 686 (12.2%) zeros | Zeros |
Reproduction
| Raw_Feat | Binned_Feat | |
|---|---|---|
| Analysis started | 2024-09-11 11:15:35.996416 | 2024-09-11 11:15:38.779108 |
| Analysis finished | 2024-09-11 11:15:38.774031 | 2024-09-11 11:15:40.224826 |
| Duration | 2.78 seconds | 1.45 second |
| Software version | ydata-profiling vv4.10.0 | ydata-profiling vv4.10.0 |
| Download configuration | config.json | config.json |
total_day_minutes
Real number (ℝ)
| Raw_Feat | Binned_Feat | |
|---|---|---|
| Distinct | 1503 | 8 |
| Distinct (%) | 26.7% | 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 902.91342 | 6.4581333 |
| Raw_Feat | Binned_Feat | |
|---|---|---|
| Minimum | 0 | 0 |
| Maximum | 2200 | 7 |
| Zeros | 46 | 46 |
| Zeros (%) | 0.8% | 0.8% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 87.9 KiB | 216.9 KiB |
Quantile statistics
| Raw_Feat | Binned_Feat | |
|---|---|---|
| Minimum | 0 | 0 |
| 5-th percentile | 291.2 | 5 |
| Q1 | 654 | 6 |
| median | 903 | 7 |
| Q3 | 1141 | 7 |
| 95-th percentile | 1515 | 7 |
| Maximum | 2200 | 7 |
| Range | 2200 | 7 |
| Interquartile range (IQR) | 487 | 1 |
Descriptive statistics
| Raw_Feat | Binned_Feat | |
|---|---|---|
| Standard deviation | 363.52611 | 0.86462714 |
| Coefficient of variation (CV) | 0.40261459 | 0.1338819 |
| Kurtosis | -0.12038144 | 24.541603 |
| Mean | 902.91342 | 6.4581333 |
| Median Absolute Deviation (MAD) | 244 | 0 |
| Skewness | 0.045615971 | -3.9044124 |
| Sum | 5078888 | 36327 |
| Variance | 132151.24 | 0.74758009 |
| Monotonicity | Not monotonic | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 46 | 0.8% |
| 926 | 15 | 0.3% |
| 941 | 14 | 0.2% |
| 1071 | 13 | 0.2% |
| 853 | 13 | 0.2% |
| 1022 | 12 | 0.2% |
| 831 | 12 | 0.2% |
| 671 | 12 | 0.2% |
| 945 | 11 | 0.2% |
| 955 | 11 | 0.2% |
| Other values (1493) | 5466 |
| Value | Count | Frequency (%) |
| 7 | 3196 | |
| 6 | 2108 | |
| 5 | 226 | 4.0% |
| 0 | 46 | 0.8% |
| 4 | 37 | 0.7% |
| 3 | 6 | 0.1% |
| 2 | 5 | 0.1% |
| 1 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 46 | |
| 6 | 1 | < 0.1% |
| 8 | 1 | < 0.1% |
| 9 | 1 | < 0.1% |
| 10 | 1 | < 0.1% |
| 16 | 1 | < 0.1% |
| 17 | 1 | < 0.1% |
| 21 | 1 | < 0.1% |
| 27 | 2 | < 0.1% |
| 29 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 46 | 0.8% |
| 1 | 1 | < 0.1% |
| 2 | 5 | 0.1% |
| 3 | 6 | 0.1% |
| 4 | 37 | 0.7% |
| 5 | 226 | 4.0% |
| 6 | 2108 | |
| 7 | 3196 |
| Value | Count | Frequency (%) |
| 0 | 46 | 0.8% |
| 1 | 1 | < 0.1% |
| 2 | 5 | 0.1% |
| 3 | 6 | 0.1% |
| 4 | 37 | 0.7% |
| 5 | 226 | 4.0% |
| 6 | 2108 | |
| 7 | 3196 |
| Value | Count | Frequency (%) |
| 0 | 46 | |
| 6 | 1 | < 0.1% |
| 8 | 1 | < 0.1% |
| 9 | 1 | < 0.1% |
| 10 | 1 | < 0.1% |
| 16 | 1 | < 0.1% |
| 17 | 1 | < 0.1% |
| 21 | 1 | < 0.1% |
| 27 | 2 | < 0.1% |
| 29 | 1 | < 0.1% |
total_day_charge
Real number (ℝ)
| Raw_Feat | Binned_Feat | |
|---|---|---|
| Distinct | 3600 | 6 |
| Distinct (%) | 64.0% | 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 44.653449 | 2.4995556 |
| Raw_Feat | Binned_Feat | |
|---|---|---|
| Minimum | 0 | 0 |
| Maximum | 98.41 | 5 |
| Zeros | 4 | 938 |
| Zeros (%) | 0.1% | 16.7% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 87.9 KiB | 216.9 KiB |
Quantile statistics
| Raw_Feat | Binned_Feat | |
|---|---|---|
| Minimum | 0 | 0 |
| 5-th percentile | 19.47 | 0 |
| Q1 | 34.53 | 1 |
| median | 44.64 | 2 |
| Q3 | 54.71 | 4 |
| 95-th percentile | 68.914 | 5 |
| Maximum | 98.41 | 5 |
| Range | 98.41 | 5 |
| Interquartile range (IQR) | 20.18 | 3 |
Descriptive statistics
| Raw_Feat | Binned_Feat | |
|---|---|---|
| Standard deviation | 15.041857 | 1.708081 |
| Coefficient of variation (CV) | 0.33685768 | 0.68335388 |
| Kurtosis | -0.036956693 | -1.2684478 |
| Mean | 44.653449 | 2.4995556 |
| Median Absolute Deviation (MAD) | 10.09 | 1 |
| Skewness | 0.036399543 | 0.00065149344 |
| Sum | 251175.65 | 14060 |
| Variance | 226.25746 | 2.9175407 |
| Monotonicity | Not monotonic | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 40.81 | 6 | 0.1% |
| 48.04 | 6 | 0.1% |
| 56.03 | 6 | 0.1% |
| 42.73 | 6 | 0.1% |
| 43.96 | 5 | 0.1% |
| 44.8 | 5 | 0.1% |
| 57.66 | 5 | 0.1% |
| 39.88 | 5 | 0.1% |
| 58.81 | 5 | 0.1% |
| 50.76 | 5 | 0.1% |
| Other values (3590) | 5571 |
| Value | Count | Frequency (%) |
| 2 | 939 | |
| 5 | 938 | |
| 0 | 938 | |
| 3 | 937 | |
| 1 | 937 | |
| 4 | 936 |
| Value | Count | Frequency (%) |
| 0 | 4 | |
| 1.02 | 1 | < 0.1% |
| 1.13 | 1 | < 0.1% |
| 2.22 | 1 | < 0.1% |
| 2.87 | 1 | < 0.1% |
| 3.19 | 1 | < 0.1% |
| 3.77 | 2 | |
| 3.8 | 1 | < 0.1% |
| 4.08 | 1 | < 0.1% |
| 4.71 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 938 | |
| 1 | 937 | |
| 2 | 939 | |
| 3 | 937 | |
| 4 | 936 | |
| 5 | 938 |
| Value | Count | Frequency (%) |
| 0 | 938 | |
| 1 | 937 | |
| 2 | 939 | |
| 3 | 937 | |
| 4 | 936 | |
| 5 | 938 |
| Value | Count | Frequency (%) |
| 0 | 4 | |
| 1.02 | 1 | < 0.1% |
| 1.13 | 1 | < 0.1% |
| 2.22 | 1 | < 0.1% |
| 2.87 | 1 | < 0.1% |
| 3.19 | 1 | < 0.1% |
| 3.77 | 2 | |
| 3.8 | 1 | < 0.1% |
| 4.08 | 1 | < 0.1% |
| 4.71 | 1 | < 0.1% |
total_eve_minutes
Numeric
| Raw_Feat | Binned_Feat | |
|---|---|---|
| Distinct | 977 | 4 |
| Distinct (%) | 17.4% | 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 87.9 KiB | 216.9 KiB |
| 695 | 20 |
|---|---|
| 604 | 20 |
| 648 | 19 |
| 776 | 19 |
| 497 | 18 |
| Other values (972) |
| 1 | |
|---|---|
| 2 | |
| 0 | |
| 3 | 109 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 5625 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Raw_Feat | Binned_Feat | |
|---|---|---|
| Unique | 164 | 0 ? |
| Unique (%) | 2.9% | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 695 | 20 | 0.4% |
| 604 | 20 | 0.4% |
| 648 | 19 | 0.3% |
| 776 | 19 | 0.3% |
| 497 | 18 | 0.3% |
| 556 | 18 | 0.3% |
| 566 | 17 | 0.3% |
| 435 | 17 | 0.3% |
| 0 | 17 | 0.3% |
| 564 | 17 | 0.3% |
| Other values (967) | 5443 |
| Value | Count | Frequency (%) |
| 1 | 3149 | |
| 2 | 1719 | |
| 0 | 648 | 11.5% |
| 3 | 109 | 1.9% |
Length
Histogram of lengths of the category
Common Values (Plot)
Raw_Feat
Number of variable categories passes threshold (
config.plot.cat_freq.max_unique)Binned_Feat
| Value | Count | Frequency (%) |
| 1 | 3149 | |
| 2 | 1719 | |
| 0 | 648 | 11.5% |
| 3 | 109 | 1.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 3149 | |
| 2 | 1719 | |
| 0 | 648 | 11.5% |
| 3 | 109 | 1.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 5625 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 3149 | |
| 2 | 1719 | |
| 0 | 648 | 11.5% |
| 3 | 109 | 1.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 5625 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 3149 | |
| 2 | 1719 | |
| 0 | 648 | 11.5% |
| 3 | 109 | 1.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 5625 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 3149 | |
| 2 | 1719 | |
| 0 | 648 | 11.5% |
| 3 | 109 | 1.9% |
total_eve_calls
Numeric
| Raw_Feat | Binned_Feat | |
|---|---|---|
| Distinct | 315 | 5 |
| Distinct (%) | 5.6% | 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 87.9 KiB | 216.9 KiB |
| 126 | 48 |
|---|---|
| 165 | 46 |
| 167 | 46 |
| 190 | 45 |
| 177 | 44 |
| Other values (310) |
| 1 | |
|---|---|
| 2 | |
| 0 | |
| 3 | |
| 4 | 25 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 5625 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Raw_Feat | Binned_Feat | |
|---|---|---|
| Unique | 27 | 0 ? |
| Unique (%) | 0.5% | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 1 |
| 3rd row | 2 |
| 4th row | 2 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 126 | 48 | 0.9% |
| 165 | 46 | 0.8% |
| 167 | 46 | 0.8% |
| 190 | 45 | 0.8% |
| 177 | 44 | 0.8% |
| 144 | 44 | 0.8% |
| 158 | 43 | 0.8% |
| 157 | 43 | 0.8% |
| 174 | 43 | 0.8% |
| 140 | 42 | 0.7% |
| Other values (305) | 5181 |
| Value | Count | Frequency (%) |
| 1 | 2352 | |
| 2 | 2173 | |
| 0 | 627 | 11.1% |
| 3 | 448 | 8.0% |
| 4 | 25 | 0.4% |
Length
Histogram of lengths of the category
Common Values (Plot)
Raw_Feat
Number of variable categories passes threshold (
config.plot.cat_freq.max_unique)Binned_Feat
| Value | Count | Frequency (%) |
| 1 | 2352 | |
| 2 | 2173 | |
| 0 | 627 | 11.1% |
| 3 | 448 | 8.0% |
| 4 | 25 | 0.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 2352 | |
| 2 | 2173 | |
| 0 | 627 | 11.1% |
| 3 | 448 | 8.0% |
| 4 | 25 | 0.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 5625 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 2352 | |
| 2 | 2173 | |
| 0 | 627 | 11.1% |
| 3 | 448 | 8.0% |
| 4 | 25 | 0.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 5625 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 2352 | |
| 2 | 2173 | |
| 0 | 627 | 11.1% |
| 3 | 448 | 8.0% |
| 4 | 25 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 5625 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 2352 | |
| 2 | 2173 | |
| 0 | 627 | 11.1% |
| 3 | 448 | 8.0% |
| 4 | 25 | 0.4% |
customer_service_rating
Numeric
| Raw_Feat | Binned_Feat | |
|---|---|---|
| Distinct | 11 | 4 |
| Distinct (%) | 0.2% | 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 87.9 KiB | 216.9 KiB |
| 8 | |
|---|---|
| 7 | |
| 10 | |
| 9 | |
| 6 | |
| Other values (6) |
| 3 | |
|---|---|
| 2 | |
| 1 | |
| 0 | 38 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 5625 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Raw_Feat | Binned_Feat | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| 1st row | 3 |
|---|---|
| 2nd row | 1 |
| 3rd row | 3 |
| 4th row | 3 |
| 5th row | 3 |
Common Values
| Value | Count | Frequency (%) |
| 8 | 1084 | |
| 7 | 1026 | |
| 10 | 900 | |
| 9 | 896 | |
| 6 | 822 | |
| 5 | 489 | |
| 4 | 265 | 4.7% |
| 3 | 105 | 1.9% |
| 2 | 25 | 0.4% |
| 1 | 10 | 0.2% |
| Value | Count | Frequency (%) |
| 3 | 2880 | |
| 2 | 1848 | |
| 1 | 859 | 15.3% |
| 0 | 38 | 0.7% |
Length
Histogram of lengths of the category
Common Values (Plot)
Raw_Feat
Number of variable categories passes threshold (
config.plot.cat_freq.max_unique)Binned_Feat
| Value | Count | Frequency (%) |
| 3 | 2880 | |
| 2 | 1848 | |
| 1 | 859 | 15.3% |
| 0 | 38 | 0.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 2880 | |
| 2 | 1848 | |
| 1 | 859 | 15.3% |
| 0 | 38 | 0.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 5625 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 3 | 2880 | |
| 2 | 1848 | |
| 1 | 859 | 15.3% |
| 0 | 38 | 0.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 5625 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 3 | 2880 | |
| 2 | 1848 | |
| 1 | 859 | 15.3% |
| 0 | 38 | 0.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 5625 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 3 | 2880 | |
| 2 | 1848 | |
| 1 | 859 | 15.3% |
| 0 | 38 | 0.7% |
customer_happiness
Real number (ℝ)
| Raw_Feat | Binned_Feat | |
|---|---|---|
| Distinct | 5625 | 8 |
| Distinct (%) | 100.0% | 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 0.50466052 | 3.5393778 |
| Raw_Feat | Binned_Feat | |
|---|---|---|
| Minimum | 0.00017939101 | 0 |
| Maximum | 0.99986871 | 7 |
| Zeros | 0 | 686 |
| Zeros (%) | 0.0% | 12.2% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 87.9 KiB | 216.9 KiB |
Quantile statistics
| Raw_Feat | Binned_Feat | |
|---|---|---|
| Minimum | 0.00017939101 | 0 |
| 5-th percentile | 0.053614748 | 0 |
| Q1 | 0.25309437 | 2 |
| median | 0.51056577 | 4 |
| Q3 | 0.75766995 | 6 |
| 95-th percentile | 0.95141039 | 7 |
| Maximum | 0.99986871 | 7 |
| Range | 0.99968932 | 7 |
| Interquartile range (IQR) | 0.50457558 | 4 |
Descriptive statistics
| Raw_Feat | Binned_Feat | |
|---|---|---|
| Standard deviation | 0.29066198 | 2.3058898 |
| Coefficient of variation (CV) | 0.57595545 | 0.65149583 |
| Kurtosis | -1.2269888 | -1.2546137 |
| Mean | 0.50466052 | 3.5393778 |
| Median Absolute Deviation (MAD) | 0.25278963 | 2 |
| Skewness | -0.0096845972 | -0.011237124 |
| Sum | 2838.7154 | 19909 |
| Variance | 0.084484386 | 5.317128 |
| Monotonicity | Not monotonic | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0.183440548 | 1 | < 0.1% |
| 0.1182925103 | 1 | < 0.1% |
| 0.5159505222 | 1 | < 0.1% |
| 0.6489132417 | 1 | < 0.1% |
| 0.5419121075 | 1 | < 0.1% |
| 0.6339349819 | 1 | < 0.1% |
| 0.582825159 | 1 | < 0.1% |
| 0.9452413572 | 1 | < 0.1% |
| 0.8098898999 | 1 | < 0.1% |
| 0.6851322974 | 1 | < 0.1% |
| Other values (5615) | 5615 |
| Value | Count | Frequency (%) |
| 7 | 752 | |
| 2 | 738 | |
| 4 | 710 | |
| 5 | 705 | |
| 6 | 698 | |
| 1 | 696 | |
| 0 | 686 | |
| 3 | 640 |
| Value | Count | Frequency (%) |
| 0.0001793910098 | 1 | |
| 0.0002135787151 | 1 | |
| 0.0004282800256 | 1 | |
| 0.0007159080322 | 1 | |
| 0.001564124626 | 1 | |
| 0.001667041497 | 1 | |
| 0.001753145611 | 1 | |
| 0.002242134096 | 1 | |
| 0.002280608135 | 1 | |
| 0.002336968566 | 1 |
| Value | Count | Frequency (%) |
| 0 | 686 | |
| 1 | 696 | |
| 2 | 738 | |
| 3 | 640 | |
| 4 | 710 | |
| 5 | 705 | |
| 6 | 698 | |
| 7 | 752 |
| Value | Count | Frequency (%) |
| 0 | 686 | |
| 1 | 696 | |
| 2 | 738 | |
| 3 | 640 | |
| 4 | 710 | |
| 5 | 705 | |
| 6 | 698 | |
| 7 | 752 |
| Value | Count | Frequency (%) |
| 0.0001793910098 | 1 | |
| 0.0002135787151 | 1 | |
| 0.0004282800256 | 1 | |
| 0.0007159080322 | 1 | |
| 0.001564124626 | 1 | |
| 0.001667041497 | 1 | |
| 0.001753145611 | 1 | |
| 0.002242134096 | 1 | |
| 0.002280608135 | 1 | |
| 0.002336968566 | 1 |
customer_service_calls
Real number (ℝ)
| Raw_Feat | Binned_Feat | |
|---|---|---|
| Distinct | 94 | 6 |
| Distinct (%) | 1.7% | 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 11.996978 | 1.5424 |
| Raw_Feat | Binned_Feat | |
|---|---|---|
| Minimum | 0 | 0 |
| Maximum | 111 | 5 |
| Zeros | 2813 | 2985 |
| Zeros (%) | 50.0% | 53.1% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 87.9 KiB | 216.9 KiB |
Quantile statistics
| Raw_Feat | Binned_Feat | |
|---|---|---|
| Minimum | 0 | 0 |
| 5-th percentile | 0 | 0 |
| Q1 | 0 | 0 |
| median | 0 | 0 |
| Q3 | 20 | 3 |
| 95-th percentile | 50 | 4 |
| Maximum | 111 | 5 |
| Range | 111 | 5 |
| Interquartile range (IQR) | 20 | 3 |
Descriptive statistics
| Raw_Feat | Binned_Feat | |
|---|---|---|
| Standard deviation | 17.533087 | 1.7869319 |
| Coefficient of variation (CV) | 1.4614586 | 1.1585398 |
| Kurtosis | 2.5048174 | -1.3635539 |
| Mean | 11.996978 | 1.5424 |
| Median Absolute Deviation (MAD) | 0 | 0 |
| Skewness | 1.6574438 | 0.52589399 |
| Sum | 67483 | 8676 |
| Variance | 307.40913 | 3.1931255 |
| Monotonicity | Not monotonic | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 2813 | |
| 1 | 100 | 1.8% |
| 3 | 89 | 1.6% |
| 6 | 88 | 1.6% |
| 15 | 80 | 1.4% |
| 5 | 80 | 1.4% |
| 7 | 72 | 1.3% |
| 2 | 72 | 1.3% |
| 12 | 71 | 1.3% |
| 10 | 71 | 1.3% |
| Other values (84) | 2089 |
| Value | Count | Frequency (%) |
| 0 | 2985 | |
| 4 | 947 | 16.8% |
| 3 | 835 | 14.8% |
| 2 | 445 | 7.9% |
| 5 | 270 | 4.8% |
| 1 | 143 | 2.5% |
| Value | Count | Frequency (%) |
| 0 | 2813 | |
| 1 | 100 | 1.8% |
| 2 | 72 | 1.3% |
| 3 | 89 | 1.6% |
| 4 | 54 | 1.0% |
| 5 | 80 | 1.4% |
| 6 | 88 | 1.6% |
| 7 | 72 | 1.3% |
| 8 | 69 | 1.2% |
| 9 | 65 | 1.2% |
| Value | Count | Frequency (%) |
| 0 | 2985 | |
| 1 | 143 | 2.5% |
| 2 | 445 | 7.9% |
| 3 | 835 | 14.8% |
| 4 | 947 | 16.8% |
| 5 | 270 | 4.8% |
| Value | Count | Frequency (%) |
| 0 | 2985 | |
| 1 | 143 | 2.5% |
| 2 | 445 | 7.9% |
| 3 | 835 | 14.8% |
| 4 | 947 | 16.8% |
| 5 | 270 | 4.8% |
| Value | Count | Frequency (%) |
| 0 | 2813 | |
| 1 | 100 | 1.8% |
| 2 | 72 | 1.3% |
| 3 | 89 | 1.6% |
| 4 | 54 | 1.0% |
| 5 | 80 | 1.4% |
| 6 | 88 | 1.6% |
| 7 | 72 | 1.3% |
| 8 | 69 | 1.2% |
| 9 | 65 | 1.2% |
churn
Categorical
| Raw_Feat | Binned_Feat | |
|---|---|---|
| Distinct | 2 | 2 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 49.6 KiB | 178.6 KiB |
| 0 | |
|---|---|
| 1 | 490 |
| 0 | |
|---|---|
| 1 | 490 |
Length
| Raw_Feat | Binned_Feat | |
|---|---|---|
| Max length | 1 | 1 |
| Median length | 1 | 1 |
| Mean length | 1 | 1 |
| Min length | 1 | 1 |
Characters and Unicode
| Raw_Feat | Binned_Feat | |
|---|---|---|
| Total characters | 5625 | 5625 |
| Distinct characters | 2 | 2 |
| Distinct categories | 1 | 1 ? |
| Distinct scripts | 1 | 1 ? |
| Distinct blocks | 1 | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Raw_Feat | Binned_Feat | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Raw_Feat | Binned_Feat | |
|---|---|---|
| 1st row | 0 | 0 |
| 2nd row | 0 | 0 |
| 3rd row | 0 | 0 |
| 4th row | 1 | 1 |
| 5th row | 0 | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 5135 | |
| 1 | 490 | 8.7% |
| Value | Count | Frequency (%) |
| 0 | 5135 | |
| 1 | 490 | 8.7% |
Length
Histogram of lengths of the category
Common Values (Plot)
Raw_Feat
Binned_Feat
| Value | Count | Frequency (%) |
| 0 | 5135 | |
| 1 | 490 | 8.7% |
| Value | Count | Frequency (%) |
| 0 | 5135 | |
| 1 | 490 | 8.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 5135 | |
| 1 | 490 | 8.7% |
| Value | Count | Frequency (%) |
| 0 | 5135 | |
| 1 | 490 | 8.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 5625 |
| Value | Count | Frequency (%) |
| (unknown) | 5625 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 5135 | |
| 1 | 490 | 8.7% |
| Value | Count | Frequency (%) |
| 0 | 5135 | |
| 1 | 490 | 8.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 5625 |
| Value | Count | Frequency (%) |
| (unknown) | 5625 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 5135 | |
| 1 | 490 | 8.7% |
| Value | Count | Frequency (%) |
| 0 | 5135 | |
| 1 | 490 | 8.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 5625 |
| Value | Count | Frequency (%) |
| (unknown) | 5625 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 5135 | |
| 1 | 490 | 8.7% |
| Value | Count | Frequency (%) |
| 0 | 5135 | |
| 1 | 490 | 8.7% |
Raw_Feat
Binned_Feat
Raw_Feat
Binned_Feat
Raw_Feat
Binned_Feat
Raw_Feat
Binned_Feat
Interaction plot not present for dataset
Raw_Feat
Binned_Feat
Interaction plot not present for dataset
Raw_Feat
Binned_Feat
Interaction plot not present for dataset
Raw_Feat
Binned_Feat
Raw_Feat
Binned_Feat
Raw_Feat
Binned_Feat
Raw_Feat
Binned_Feat
Raw_Feat
Binned_Feat
Interaction plot not present for dataset
Raw_Feat
Binned_Feat
Interaction plot not present for dataset
Raw_Feat
Binned_Feat
Interaction plot not present for dataset
Raw_Feat
Binned_Feat
Raw_Feat
Binned_Feat
Interaction plot not present for dataset
Raw_Feat
Binned_Feat
Interaction plot not present for dataset
Raw_Feat
Binned_Feat
Interaction plot not present for dataset
Raw_Feat
Binned_Feat
Interaction plot not present for dataset
Raw_Feat
Binned_Feat
Interaction plot not present for dataset
Raw_Feat
Binned_Feat
Interaction plot not present for dataset
Raw_Feat
Binned_Feat
Interaction plot not present for dataset
Raw_Feat
Binned_Feat
Interaction plot not present for dataset
Raw_Feat
Binned_Feat
Interaction plot not present for dataset
Raw_Feat
Binned_Feat
Interaction plot not present for dataset
Raw_Feat
Binned_Feat
Interaction plot not present for dataset
Raw_Feat
Binned_Feat
Interaction plot not present for dataset
Raw_Feat
Binned_Feat
Interaction plot not present for dataset
Raw_Feat
Binned_Feat
Interaction plot not present for dataset
Raw_Feat
Binned_Feat
Interaction plot not present for dataset
Raw_Feat
Binned_Feat
Interaction plot not present for dataset
Raw_Feat
Binned_Feat
Interaction plot not present for dataset
Raw_Feat
Binned_Feat
Interaction plot not present for dataset
Raw_Feat
Binned_Feat
Interaction plot not present for dataset
Raw_Feat
Binned_Feat
Interaction plot not present for dataset
Raw_Feat
Binned_Feat
Interaction plot not present for dataset
Raw_Feat
Binned_Feat
Raw_Feat
Binned_Feat
Raw_Feat
Binned_Feat
Raw_Feat
Binned_Feat
Interaction plot not present for dataset
Raw_Feat
Binned_Feat
Interaction plot not present for dataset
Raw_Feat
Binned_Feat
Interaction plot not present for dataset
Raw_Feat
Binned_Feat
Raw_Feat
Binned_Feat
Raw_Feat
Binned_Feat
Raw_Feat
Binned_Feat
Raw_Feat
Binned_Feat
Interaction plot not present for dataset
Raw_Feat
Binned_Feat
Interaction plot not present for dataset
Raw_Feat
Binned_Feat
Interaction plot not present for dataset
Raw_Feat
Binned_Feat
Raw_Feat
Binned_Feat
Raw_Feat
| churn | customer_happiness | customer_service_calls | customer_service_rating | total_day_charge | total_day_minutes | total_eve_calls | total_eve_minutes | |
|---|---|---|---|---|---|---|---|---|
| churn | 1.000 | 0.239 | 0.589 | 0.221 | 0.000 | 0.018 | 0.007 | 0.020 |
| customer_happiness | 0.239 | 1.000 | -0.005 | -0.025 | 0.013 | 0.024 | 0.014 | 0.002 |
| customer_service_calls | 0.589 | -0.005 | 1.000 | 0.027 | 0.003 | -0.004 | -0.001 | -0.004 |
| customer_service_rating | 0.221 | -0.025 | 0.027 | 1.000 | -0.001 | -0.007 | -0.002 | -0.033 |
| total_day_charge | 0.000 | 0.013 | 0.003 | -0.001 | 1.000 | 0.021 | -0.027 | -0.004 |
| total_day_minutes | 0.018 | 0.024 | -0.004 | -0.007 | 0.021 | 1.000 | -0.012 | 0.001 |
| total_eve_calls | 0.007 | 0.014 | -0.001 | -0.002 | -0.027 | -0.012 | 1.000 | -0.007 |
| total_eve_minutes | 0.020 | 0.002 | -0.004 | -0.033 | -0.004 | 0.001 | -0.007 | 1.000 |
Binned_Feat
| churn | customer_happiness | customer_service_calls | customer_service_rating | total_day_charge | total_day_minutes | total_eve_calls | total_eve_minutes | |
|---|---|---|---|---|---|---|---|---|
| churn | 1.000 | 0.236 | 0.611 | 0.221 | 0.017 | 0.037 | 0.024 | 0.020 |
| customer_happiness | 0.236 | 1.000 | -0.005 | 0.025 | 0.013 | 0.013 | 0.004 | 0.021 |
| customer_service_calls | 0.611 | -0.005 | 1.000 | 0.013 | -0.000 | -0.008 | 0.000 | 0.000 |
| customer_service_rating | 0.221 | 0.025 | 0.013 | 1.000 | 0.000 | 0.000 | 0.024 | 0.000 |
| total_day_charge | 0.017 | 0.013 | -0.000 | 0.000 | 1.000 | 0.020 | 0.014 | 0.007 |
| total_day_minutes | 0.037 | 0.013 | -0.008 | 0.000 | 0.020 | 1.000 | 0.010 | 0.023 |
| total_eve_calls | 0.024 | 0.004 | 0.000 | 0.024 | 0.014 | 0.010 | 1.000 | 0.031 |
| total_eve_minutes | 0.020 | 0.021 | 0.000 | 0.000 | 0.007 | 0.023 | 0.031 | 1.000 |
Raw_Feat
A simple visualization of nullity by column.
Binned_Feat
A simple visualization of nullity by column.
Raw_Feat
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
Binned_Feat
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
Raw_Feat
| total_day_minutes | total_day_charge | total_eve_minutes | total_eve_calls | customer_service_rating | customer_happiness | customer_service_calls | churn | |
|---|---|---|---|---|---|---|---|---|
| 497788 | 395.0 | 60.53 | 608.0 | 61 | 9 | 0.183441 | 0 | 0 |
| 69163 | 782.0 | 49.06 | 610.0 | 144 | 3 | 0.010195 | 0 | 0 |
| 346342 | 1144.0 | 29.49 | 549.0 | 183 | 8 | 0.710498 | 0 | 0 |
| 59207 | 1148.0 | 46.84 | 560.0 | 163 | 10 | 0.040356 | 51 | 1 |
| 682551 | 941.0 | 65.91 | 438.0 | 99 | 10 | 0.260329 | 19 | 0 |
| 537407 | 1123.0 | 37.28 | 510.0 | 243 | 8 | 0.550550 | 6 | 0 |
| 966010 | 1102.0 | 26.83 | 526.0 | 109 | 9 | 0.333661 | 0 | 0 |
| 344765 | 1277.0 | 46.70 | 846.0 | 117 | 6 | 0.012234 | 1 | 0 |
| 406546 | 1496.0 | 51.42 | 417.0 | 182 | 7 | 0.459110 | 39 | 0 |
| 797390 | 823.0 | 50.18 | 765.0 | 240 | 7 | 0.221134 | 9 | 0 |
Binned_Feat
| total_day_minutes | total_day_charge | total_eve_minutes | total_eve_calls | customer_service_rating | customer_happiness | customer_service_calls | churn | |
|---|---|---|---|---|---|---|---|---|
| 497788 | 6 | 5 | 1 | 0 | 3 | 1 | 0 | 0 |
| 69163 | 6 | 3 | 1 | 1 | 1 | 0 | 0 | 0 |
| 346342 | 7 | 0 | 1 | 2 | 3 | 5 | 0 | 0 |
| 59207 | 7 | 3 | 1 | 2 | 3 | 0 | 5 | 1 |
| 682551 | 7 | 5 | 1 | 1 | 3 | 2 | 3 | 0 |
| 537407 | 7 | 1 | 1 | 3 | 3 | 4 | 2 | 0 |
| 966010 | 7 | 0 | 1 | 1 | 3 | 2 | 0 | 0 |
| 344765 | 7 | 3 | 2 | 1 | 2 | 0 | 0 | 0 |
| 406546 | 7 | 4 | 1 | 2 | 2 | 3 | 4 | 0 |
| 797390 | 6 | 3 | 2 | 3 | 2 | 1 | 2 | 0 |
Raw_Feat
| total_day_minutes | total_day_charge | total_eve_minutes | total_eve_calls | customer_service_rating | customer_happiness | customer_service_calls | churn | |
|---|---|---|---|---|---|---|---|---|
| 472777 | 1073.0 | 67.53 | 802.0 | 258 | 10 | 0.895480 | 0 | 0 |
| 440487 | 937.0 | 48.28 | 489.0 | 162 | 9 | 0.672675 | 37 | 0 |
| 243139 | 583.0 | 23.73 | 452.0 | 153 | 10 | 0.543898 | 0 | 0 |
| 557988 | 485.0 | 35.58 | 637.0 | 186 | 7 | 0.440868 | 0 | 0 |
| 857974 | 945.0 | 35.05 | 514.0 | 154 | 10 | 0.407123 | 0 | 0 |
| 914712 | 909.0 | 34.97 | 405.0 | 93 | 9 | 0.250214 | 0 | 0 |
| 559243 | 606.0 | 63.27 | 987.0 | 171 | 5 | 0.949713 | 0 | 0 |
| 77144 | 584.0 | 36.39 | 641.0 | 84 | 8 | 0.448140 | 13 | 0 |
| 191612 | 1336.0 | 24.68 | 713.0 | 217 | 9 | 0.003229 | 0 | 1 |
| 650330 | 345.0 | 33.16 | 625.0 | 253 | 6 | 0.697006 | 0 | 0 |
Binned_Feat
| total_day_minutes | total_day_charge | total_eve_minutes | total_eve_calls | customer_service_rating | customer_happiness | customer_service_calls | churn | |
|---|---|---|---|---|---|---|---|---|
| 472777 | 7 | 5 | 2 | 3 | 3 | 7 | 0 | 0 |
| 440487 | 7 | 3 | 1 | 2 | 3 | 5 | 4 | 0 |
| 243139 | 6 | 0 | 1 | 1 | 3 | 4 | 0 | 0 |
| 557988 | 6 | 1 | 1 | 2 | 2 | 3 | 0 | 0 |
| 857974 | 7 | 1 | 1 | 1 | 3 | 3 | 0 | 0 |
| 914712 | 7 | 1 | 1 | 1 | 3 | 2 | 0 | 0 |
| 559243 | 6 | 5 | 2 | 2 | 1 | 7 | 0 | 0 |
| 77144 | 6 | 1 | 1 | 1 | 3 | 3 | 3 | 0 |
| 191612 | 7 | 0 | 2 | 2 | 3 | 0 | 0 | 1 |
| 650330 | 6 | 1 | 1 | 3 | 2 | 5 | 0 | 0 |
Raw_Feat
| total_day_minutes | total_day_charge | total_eve_minutes | total_eve_calls | customer_service_rating | customer_happiness | customer_service_calls | churn | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|
| Dataset does not contain duplicate rows. | |||||||||
Binned_Feat
| total_day_minutes | total_day_charge | total_eve_minutes | total_eve_calls | customer_service_rating | customer_happiness | customer_service_calls | churn | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|
| 402 | 7 | 0 | 1 | 1 | 3 | 0 | 0 | 0 | 10 |
| 893 | 7 | 5 | 1 | 1 | 3 | 0 | 0 | 0 | 10 |
| 734 | 7 | 3 | 1 | 2 | 3 | 7 | 0 | 0 | 9 |
| 99 | 6 | 1 | 1 | 2 | 2 | 2 | 0 | 0 | 7 |
| 146 | 6 | 2 | 1 | 1 | 3 | 1 | 0 | 0 | 7 |
| 277 | 6 | 4 | 1 | 1 | 3 | 2 | 0 | 0 | 7 |
| 397 | 7 | 0 | 1 | 1 | 2 | 2 | 0 | 0 | 7 |
| 420 | 7 | 0 | 1 | 2 | 2 | 3 | 0 | 0 | 7 |
| 431 | 7 | 0 | 1 | 2 | 3 | 6 | 0 | 0 | 7 |
| 492 | 7 | 1 | 1 | 1 | 3 | 1 | 0 | 0 | 7 |